RoboCop: Crime Classification and Prediction in San Francisco

نویسنده

  • John Cherian
چکیده

In this paper, we employ machine learning and other statistical techniques to the problems of classifying and predicting crimes in San Francisco. Drawing upon existing research in the field to approach these two problems, we employ Random Forest and VAR(p) models, respectively. For the classification problem, our results across all 39 crime categories demonstrate the difficulty of the fully-specified crime classification problem, as we achieve a maximum 39-way classification accuracy of 31.84%. Although our results are perhaps inappropriate for daily or weekly use in any police organization, the time series model performs adequately at forecasting crime incident averages in the coming weeks and months. With more data and the use of a time series model already developed by these authors for discrete time series, our results might be improved upon further.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Crime Prediction in San Fransisco

In June 2015, Kaggle began a competition named “San Francisco Crime Classification”[8], ending in June 2016. The competition’s dataset caught our attention due the subject being very tangible, with crime being at the forefront of modern media and to San Francisco being culturally significant due to its current tech industry. The dataset is also described by geographic and temporal features, the...

متن کامل

San Francisco Crime Classification

San Francisco Crime Classification is an online competition administered by Kaggle Inc. The competition aims at predicting the future crimes based on a given set of geographical and time-based features. In this paper, I achieved a an accuracy that ranks at top %18, as of May 19th, 2016. I will explore the data, and explain in details the tools I used to achieve that result.

متن کامل

CS 229 Project Report: San Francisco Crime Classification

Different machine learning approaches were conceptualized and implemented for predicting the probabilities of crime categories for crimes reported in San Francisco. The crimes records used in the research are downloaded from a competition on Kaggle. A Bayesian model, a mixture of Guassians model (stratified and unstratified), and logistic regression are implemented. A satisfactory result was ac...

متن کامل

San Francisco Crime Classification 2015

We aim to classify the type of crimes committed within San Francisco, given the time and location of a criminal occurrence. This study is important and beneficial. Using data mining approaches, we can predict the location, type and time of criminal occurrences in the city. We also explore some interesting questions, for example, if more crimes occur on certain days of the week or certain times ...

متن کامل

Low Molecular Weight Heparin Overdose: A 10 Year Case Series

  Background: Low molecular weight heparin (LMWH) is used for the treatment and prevention of coagulative disorders. Few patients receiving therapeutic doses of LMWH develop major hemorrhage. Currently there are few reports in the literature on acute overdose on adults.  In this study, clinical profile, treatment and outcome of 21 patients who acutely overdosed enoxaparin are described. Method...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015